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Evidence for Merger-Driven Activity in the Clustering of 
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ABSTRACT 

Recently, a very large clustering length has been measured for quasars at a redshift 
of z ~ 4. In combination with the observed quasar luminosity function we assess the 
implications of this clustering for the relationship between quasar luminosity and dark 
matter halo mass. Our analysis allows for non-linearity and finite scatter in the relation 
between quasar luminosity and halo mass, as well as a luminosity dependent quasar 
lifetime. The additional novel ingredient in our modelling is the allowance for an excess 
in the observed bias over the underlying halo bias owing to the merger driven nature 
of quasar activity. We find that the observations of clustering and luminosity function 
can be explained only if both of the following conditions hold: (i) The luminosity to 
halo mass ratio increases with halo mass; (ii) The observed clustering amplitude is in 
excess of that expected solely from halo bias. The latter result is statistically significant 
at the 99% level. Taken together, the observations provide compelling evidence for 
merger driven quasar activity, with a black-hole growth that is limited by feedback. In 
difference from previous analyses, we show that there could be scatter in the luminosity 
halo mass relation of up to 1 dex, and that quasar clustering can not be used to 
estimate the quasar lifetime. 

Key words: cosmology: large scale structure, theory - quasars: general 
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1 INTRODUCTION 

The Sloan Digital Sky Survey (SDSS; York et al. 2000) and 
the 2dF quasar redshift survey (Croom et al. 2001a) have 
measured redshifts for large samples of quasars, and deter- 
mined their luminosity function over a broad section of cos- 
mic history (Boyle et al. 2000; Richards et al. 2006). These 
surveys have also been used to constrain the clustering prop- 
erties of quasars (e.g. Croom et al. 2001b; Croom et al. 2002; 
Croom et al. 2005; Porciani & Norberg 2006; da Angela et 
al. 2008; Padmanabhan et al. 2008), including the variation 
of clustering length with redshift and luminosity. In the local 
Universe quasars have clustering statistics similar to opti- 
cally selected galaxies, with a clustering length Ro ~ 8Mpc. 
The clustering length increases towards high redshift, but is 
only weakly dependent on luminosity. This has been inter- 
preted as evidence for a model in which the quasar luminos- 
ity takes on a broad range of values at different stages of its 
evolution (Lidz et al. 2006). 

In addition to the large quasar samples below z ~ 3, the 
Sloan Digital Sky Survey (SDSS) has discovered luminous 
quasars at redshifts as high as z ~ 6.4, i.e., when the uni- 
verse was less than a billion years old (Fan et al. 2001a,b; Fan 



et al. 2003; Fan et al. 2006). The supermassive black-holes 
(SMBH) powering these quasars have masses of > 10 9 A'/ Q . 
Recently, clustering measurements of quasars have been ex- 
tended out to redshifts beyond z ~ 4 using the Fifth Data 
release of the SDSS (Shen et al. 2007). Quasars are shown 
to be significantly more clustered at high redshift relative 
to the more local samples, with values of Ro ~ 12Mpc 
and Ro ~ 17Mpc at z ~ 3 and z > 3.5 respectively 
(Shen et al. 2007). White, Martini & Cohn (2008) have re- 
cently shown that this large clustering length can be associ- 
ated with the observation of a very large clustering bias of 
b = 14.2 ± 1.4 for quasars at z ~ 4. 

It was argued by Martini & Weinberg (2001) and 
Haiman & Hui (2001), that the quasar correlation length 
can be used to infer the typical mass of dark matter ha- 
los in which quasars reside. One may then derive the quasar 
duty-cycle by comparing the number density of quasars with 
the density of host dark matter halos. The quasar lifetime 
follows from the product of the duty-cycle and the time 
that the dark-matter halo spends in between major merg- 
ers (although there is a degeneracy between the lifetime and 
the quasar occupation fraction or beaming). Results from 
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low redshift clustering have suggested quasar lifetimes of 
t q ~ 10 6 -10 7 years, consistent with the values determined 
by other methods (see Martini 2003 for a review). More re- 
cently, Shen et al. (2007) have applied this analysis to their 
measurements of quasar clustering at high redshift. They 
infer lifetimes of t q ~ 3 x 10 7 -6 x 10 s years for quasars at 
z > 3.5. 

Using techniques similar to those employed at low red- 
shift, this result has been used by White et al. (2008) to 
argue that the scatter in the relation between quasar lu- 
minosity and halo mass must be smaller than 0.3dex. This 
small scatter poses a problem for theories of quasar growth 
and formation. The tightest local relation is observed be- 
tween the black hole mass and bulge velocity dispersion, 
also with a scatter of ~ 0.3 dex. However, the relation be- 
tween quasar luminosity and halo mass must have several 
additional sources of scatter, including in the relation be- 
tween halo mass and velocity dispersion, and between black 
hole mass and quasar luminosity. It is therefore difficult to 
understand how the luminosity - halo mass relation could be 
as tight as the black-hole mass - velocity dispersion relation. 

The conclusion of White et al. (2008) arises because 
an increased scatter would tend to bring large numbers of 
low bias halos (which are more common) into a sample of 
halos at fixed quasar luminosity. The required small scat- 
ter therefore depends crucially on the assumption that the 
value of the observed bias truly reflects the actual bias of 
the quasar host galaxies. However additional bias may exist 
beyond the usual halo bias for systems which are triggered 
by mergers, although the magnitude (and even the sign) of 
the effect are still being debated (e.g., Kolatt et al. 1999; 
Gottlober et al. 2002; Kauffmann & Haehnelt 2002; Percival 
et al. 2003; Scannapieco & Thacker 2003; Gao, Springel & 
White 2005; Wetzel et al. 2007). Interpreting the results of 
simulations of the effect of mergers on bias is complicated 
by the small range of redshift and mass available in differ- 
ent studies. Furlanetto & Kamionkowski (2006) attempted 
to describe the clustering bias of merging systems more gen- 
erally based on an analytic model. They concluded that a 
peak-background split approach within the extended Prcss- 
Schechter formalism cannot be used to calculate the merger 
bias because the large scale density field does not enter the 
calculation of merger rates using this approach. Instead, 
Furlanetto & Kamionkowski (2006) calculated the merger 
bias in models where the merger rate per halo is assumed to 
scale with the number densities of neighboring halos. Since 
this model does not account for the large scale density in the 
merger rate itself (but only in the halo densities), Furlanetto 
& Kamionkowski (2006) argue that it can only be consid- 
ered qualitative. However, the model suggests that merging 
halos are significantly more clustered than isolated halos, by 
a factor of ~ 1.5 for massive halos at z ~ 3. Furlanetto & 
Kamionkowski (2006) also proposed a simple model in which 
close pairs are assumed to be merging systems, and compute 
the bias of those based on probabilities of separation based 
on clustering statistics in a variety of models, with similar 
results. 

The possibility that the observed bias exceeds the av- 
erage halo bias at a particular halo mass is not restricted to 
scenarios where observed objects are merger driven. A sep- 
arate, but related issue concerns halo formation bias. Here 
sub-samples of dark matter halos of fixed mass and redshift 



are shown to have clustering statistics that depend on their 
formation history (Gao, Springel & White 2005; Croton, Gao 
& White 2007; Wetzel et al. 2007). For example, if the ex- 
istence of luminous quasars required that the host halo be 
older than a particular minimum age, then this sub-sample 
of halos would have a clustering bias in excess of the halo 
population as a whole. 

Since luminous quasars play an important role in the 
evolution of massive galaxies at high redshifts, a vari- 
ety of models have been proposed to explain the high 
redshift luminosity function of quasars (e.g. Haiman & 
Loeb 1998; Haehnelt, Natarajan & Rees 1998; Kauffmann 
& Haehnelt 2000; Volonteri, Haardt & Madau 2003; Wyithe 
& Loeb 2003; Di Matteo, Springel & Hernquist 2005; Hop- 
kins & Hernquist 2008; Hopkins et al. 2008; Li et al. 2007). 
The majority of these models assume that major mergers 
drive the quasar activity, implying that the observed bias 
should not simply reflect the bias based on the host halo 
mass. 

In this paper we investigate the evidence for an addi- 
tional bias, beyond the spatial clustering of dark matter ha- 
los, in the clustering data of high redshift quasars. Wyithe & 
Padmanabhan (2006) showed that different models are able 
to reproduce the data on the high redshift quasar luminosity 
function, implying a degeneracy among their input physical 
parameters. For this reason, we will not attempt to model 
physical processes such as SMBH growth and feedback, but 
instead adopt the more general approach of parameterising 
relations like the correlation between SMBH mass and halo 
mass. We are interested in the full range of allowed parame- 
ters as well as in which sets of parameters can be excluded, 
rather than in a particular set of parameters that is able to 
describe the data. This approach allows us to isolate the re- 
lationships that are constrained by the data, and to provide 
robust observational input for future theoretical modeling. 
In particular, the contribution of mergers to the observed 
clustering bias could be large, although its value is theoret- 
ically uncertain. We therefore treat this contribution as a 
free parameter, and attempt to constrain its value. 

In our numerical examples we adopt the standard set of 
cosmological parameters (Komatsu et al. 2008), with values 
of fi m = 0.24, fib = 0.046 and SIq = 0.72 for the matter, 
baryon, and dark energy fractional density respectively, and 
h = 0.70, for the dimensionless Hubble constant. We assume 
us = 0.82 for the rms amplitude of the density field fluctu- 
ations within spheres of radius 8/i _1 Mpc linearly extrapo- 
lated to z = 0, and a power-law slope for the primordial 
density power-spectrum of n B = 0.96. 



2 MODEL 

Quasars are known to have a finite distribution of Edding- 
ton ratios (Kollmeier et al. 2006), rj = l/L^dd, where I 
and Z/Edd are the quasar and Eddington luminosities re- 
spectively, and are thought to have a light-curve that varies 
across a large range of luminosity during the quasar lifetime 
(Hopkins & Hernquist 2008) . We can express this lightcurvc 
as l(t, Mbh) oc rj(t)Mbh, where Mbh is the black hole mass 
and r] is a dimensionless function of time. Consider now a 
population of quasars at a fixed instant in time. We expect 
to observe a range of quasar luminosity I at fixed host halo 
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mass M because different quasars will be at a different phase 
of their light curve. We first compute the distribution of lu- 
minosity I at fixed halo mass and fixed black hole mass 



dP_ 

dl 



dt dP D 



M,M bh 



dl dt 



d v \ e(tit) 



(i) 



Here, (dP pr ior/dt) — S(tjt)/tit (where O is the Heaviside 
Step function), is constant during the quasar lifetime (0 < 
t < tit), and zero at other times. The distribution of I at 
fixed halo mass is then 



dP 
dl 



— oc / dM] 



dP 



dMh 



dP 

~dT 



(2) 



M,M hb 



where 



is the probability distribution for the black- 



dM hh | M 

hole mass which depends on halo mass. The mean of the 
logarithm of the luminosity at halo mass M can now be 
calculated from 



(IogL)(Af)= / d log Z log/ 



dP 



dlog I 



while the variance (in dex) can be calculated as 



A 2 (M) = J dlogZ(logZ-logi) 2 



dP 



dlog I 



(3) 



(4) 



In this paper we do not use the above equations to com- 
pute the relation between the quasar luminosity, L, and host 
halo mass, M. Rather, we suggest a parametrised form based 
on the above definitions and constrain the parameters of 
the L-M relation using the available data for high redshift 
quasars. We begin by parameterising the relation between 
the mean quasar luminosity, L, and host halo mass, M, as 



L 



M 
Mo 



(5) 



where Lo and Mo are normalisation constants. We then sup- 
pose that this relation has an intrinsic scatter in luminosity 
I at fixed halo mass M of A. For simplicity, we assume that 
the distribution dP/dl\M is log-normal and express its scat- 
ter in dex. Re-writing the mean relation as 



M = U, ( 

Lo 



(6) 



we then specify the scatter in the M-L relation in terms of a 
probability distribution for halo mass m at fixed luminosity 
L i.e. 



dP 



d log m 



277(A/ 7 ) 



exp 



(logm - logM) 2 



2(A/ 7 ) 2 



(7) 



We next specify a model for the luminosity function of 
quasars (i.e. the number density of quasars per unit lumi- 
nosity with luminosity L) 



dn f dn dm 

dL I ^ dm dL 



dP 



L,M 



dlogm 



/duty 



(8) 

where is the Sheth-Tormen (1999) mass-function for 
dark matter halos of mass M, and the derivative 



dm 
~dL 



(9) 



m Mo 1 f L 
M Lo~7 \L~o~ 

The duty-cycle /duty = tit/ is, where tn is the Hubble time 



dM dm 
~dLdM 



at redshift z, is the fraction of halos that have a quasar in a 
luminous phase, and the last factor in equation JH} accounts 
for the possibility that the quasar lifetime tn is dependent on 
black hole mas^J. The luminosity function has an associated 
logarithmic slope 



(3{L) 



d ( dn 

log — 

dlogL \ dL 



(10) 



We also compute the density of quasars above a limiting 
luminosity L, 

The halo bias of quasars with luminosity L is given by, 



b(L) = 



J d\ogmb(m 



dP 



dn 

dm d log m 



J t/log? 



dP 



f dn 

J dm d log m 



(12) 



where b(m) is the halo bias of a halo with mass m, which 
is computed using the fitting formula of Sheth, Mo & Tor- 
men (1999). This bias can be used to calculate the halo bias 
of a sample of quasars with luminosities above a limiting 
value. 



(b) 



JTdL^) 

SI 



dL' -S. 

dL' 



(13) 



As noted in §1, this average halo bias may underestimate 
the observed quasar bias if quasars are triggered by galaxy 
mergers (Furlanetto & Kamionkowski 2006). To account for 
this possibility, we introduce a free parameter F, defined as 
the ratio of the observed bias b b s to the halo bias (b), 



feobs 

w 



(14) 



Note that the value of the black-hole mass does not en- 
ter our calculations. Thus, our conclusions are insensitive 
to assumptions regarding the Eddington ratio and the re- 
lation between black-hole mass and halo mass. Conversely, 
this means that our model is not able to address issues sur- 
rounding these relations. Nevertheless, as we show below, 
this simplicity does allow our model to reach some strong 
conclusions regarding the relationship between the observed 
quasar clustering and the host halo mass. 

Before proceeding to analyse the implications of exist- 
ing observational data, we note that the detailed results are 
sensitive to the form of the halo mass function. Our fiducial 
calculations use the Sheth-Tormen form. However in § !3.5l we 
also present results using the corresponding Press-Schechter 
formulation to assess the level of theoretical uncertainty in 
the results. 



In this paper we only consider the luminosity function and its 
slope at a single value of luminosity. While we have quoted a 
power-law form for the dependence of duty cycle on luminosity 
we note that this should be thought of as a logarithmic slope 
at luminosity L. In general the value of a must be luminosity 
dependent or else the duty-cycle would exceed unity for some 
values of luminosity. 
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3 COMPARISON WITH OBSERVATIONS 

The quantities n(> L), (5 and b h s are observed proper- 
ties of the quasar population, with values of n(> L) ~ 
0.7 x 10" 7 Mpc _1 , /3 = -2.58 ± 0.23 and 6 obs = 14.2 ± 1.4, 
respectively. The quasar sample from which the clustering 
and corresponding density data were obtained was described 
in Shen et al. (2007). The bias and density for quasars at 
z ~ 4 were discussed for this sample in White, Martini & 
Cohn (2008), and we use their calculated bias in this paper. 
The luminosity function slope is taken from the quasar lu- 
minosity function analysis of Fan et al. (2001b), for which 
the mean quasar redshift and absolute AB magnitude at 
1450Awere z = 4.3 and M1450 fti —26, respectively. These 
quantities can be compared to the theoretical expectations 
described above, and hence used to constrain the free pa- 
rameters, /duty, a, A, 7 and F. 



3.1 Results 

We begin by illustrating the dependencies of the observ- 
ables on the available free parameters. In Figures Q] and [2] 
we show contours of constant {b)/F in the plane of /duty 
and A. Since the halo bias is related to the observed bias 
through an unknown constant F, we begin by showing two 
cases for illustration. In the first we choose F = 1, which is 
the standard model employed to link clustering statistics to 
halo mass and quasar lifetime in previous studies (Martini 
& Weinberg 2001; Haiman & Hui 2001; White, Martini & 
Cohn 2008). In the second, we adopt F = 1.5, which is ap- 
proximately the factor by which the observed bias exceeds 
the halo bias in the models of Furlanetto & Kamionkowski 
(2006). In Figures [T] and [2] the solid and dashed contours 
are plotted for values of F = 1.0 and F = 1.5, respec- 
tively. We find that there are no combinations of /duty and 
A that produce the best fit value of the observed bias for 
F — 1 (see also White, Martini & Cohn 2008) . In this case 
we draw a contour (solid line) at the 2-a lower limit of 
(b) = (14.2 ± 1.4) IF = 14.2 - 2.8 = 11.4. In the second 
case with F = 1.5, the contours (dashed lines) are drawn to 
correspond to the observed bias (b) — (14.2 ± 1A)/F. 

In constructing these contours, for each combination of 
/duty and A the luminosity L is chosen so that the luminosity 
function reproduces the observed density of quasars. The 
duty cycle on the :r-axis corresponds to quasars with this 
luminosity L. For fixed values of F, 7 and a, the contours 
in Figures [T] and [2] enclose the acceptable combinations of 
/duty and A, given the constraint of the observed bias. We 
find that smaller values of duty-cycle permit smaller values 
of intrinsic scatter in the L — M relation. The two figures 
separate the effect of the parameters a and 7 on the derived 
constraints. In Figure [1] we set a — and vary the value 
of 7. In Figure [2] we fix the value of 7 = 1 and vary a. 
The observed bias tightly constrains the scatter in the L-M 
relation (White et al. 2008). In particular, if F = 1, 7 = 1, 
the constraints show that A < 0.1 at the 2-a level, which 
is tighter than the local relation between black-hole mass 
and velocity dispersion. The clustering and density data are 
not consistent with F = 1 at the 1-sigma level for any of the 
parameter sets shown. Since the scatter A must be finite, the 
contours therefore suggest that F > 1, even at the highest 
possible duty-cycle. 



In addition to the observed bias, our model is con- 
strained by the observed luminosity function slope. In Fig- 
ures [T] and [2] we also show the region of parameter space 
that is consistent with the constraint from the observed lu- 
minosity function slope (grey scale). The predicted slope is 
sensitive to the value of 7. As a result, the values of /duty and 
A that are consistent with the observed slope are also sen- 
sitive to 7. Viable models must have overlapping regions of 
parameter space that satisfy the different constraints. Fig- 
ures 11121 show that unless 7 or a are large, there are no 
combinations of /duty and A that are even marginally con- 
sistent with the constraints of observed bias and luminosity 
function slope for F = 1. 

3.2 Joint Likelihood distributions for pairs of 
parameters 

To quantify the above conclusions we next calculate the joint 
likelihoods for pairs of parameters, and the a'posteriori prob- 
ability distributions for individual parameters. 

The likelihood for the parameter set (A, 7, /duty, F, a) 

is 



-A,7,/ duty , 



exp 



F{b) - 14.2 
L4 



+ 



0-2.58 
0.23 



(15) 



We then obtain marginalised likelihoods by integrating over 
the remaining parameters. For example, the joint likelihood 
for A and /duty is 



C 



A,/ d 



da 



dFC 



A, 7, /duty. -F> 



^ dPpTior f^-fprior dPp r i Qr (16) 



da 



d-\ 



dF 



where a m in and a ma x are the bounding values on the finite 
region of the prior probability distribution dP pT i OI /da for the 
variable a, and so on. We assume constant prior probability 
distributions for a, A and 7. For F and tduty, which are 
fractional quantities, we assume prior probability distribu- 
tions that are flat in the logarithm of these quantities. We 
assume the prior probability to be non-zero within the fol- 
lowing ranges: aw„ = -2, a max = 2; 7 min = 0, 7 max = 2.5; 

A m in — 0, A max — 2, F m in — 0.5, F m ax — 3, /duty, mill — 0.01, 

/duty.m a x = 1. While some of our quantitative results are sen- 
sitive to the values of these limits, our primary qualitative 
conclusions are robust to the choice of the prior probability 
distributions and their ranges. 

Figure [3] shows contours of the joint likelihood distribu- 
tions for /duty and F (top left panel), for 7 and F (top right 
panel), for A and a (central left panel), for F and A (central 
right panel), for F and a (lower left panel) and for 7 and a 
(lower right panel). Contours are shown at 60% and 7% of 
the maximum likelihood. Since the number of free param- 
eters exceeds the number of observables, a unique model 
cannot be found. Nevertheless, the results of Figure [3] il- 
lustrate that despite some degeneracies, several interesting 
statements can be made regarding the allowed parameter 
values. 

Our main results are as follows. First, the parameter a 
is constrained to lie within a finite range centered around 
a « —0.3. In addition, lower limits can be placed on the 
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Figure 1. Solid and dashed lines: Contours of joint likelihood for pairs of A and /duty given the observed bias. The solid and dashed 
contours correspond to values of F = 1.0 and F = 1.5, respectively. For F = 1, the contour is drawn at the 2-<r lower limit of (b) = 11.4. 
For F = 1.5, the contours are drawn at (b) = (14.2±1.4)/F. In each case the luminosity L corresponds to the observed density of quasars. 
The grey-scale shows the corresponding joint likelihood given the constraint of the observed luminosity function slope /3 = —2.58 ± 0.23. 
The dark and light grey show the regions where the predicted slope is within 1 and 2-sigma of the best fit value. Four examples are 
shown for different values of 7, with a = in all cases. 
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Figure 3. Contours of the joint likelihood distributions for /duty and F (top left panel), for 7 and F (top right panel), for A and a 
(central left panel), for F and A (central right panel), for F and a (lower left panel) and for 7 and a (lower right panel). Contours are 
shown at 60% (solid) and 7% (dotted) of the maximum likelihood. See text for details on the assumed prior probability distributions. 



value 7, which we find to be larger than unity. The value of 
the scatter is restricted to be A < 1 dex. This constraint is 
much less stringent than the conclusion of White, Martini & 
Cohn (2008). The difference can be traced to the fact that 
White et al. (2008) assumed the halo bias to be equal to the 
observed bias. We find that degeneracies prevent the data 
from imposing an upper limit on 7 or a lower limn0 on A (al- 
though the latter cannot be negative). We find that the duty 
cycle is unconstrained by these observations. This result dif- 
fers from previous studies (e.g. Martini & Weinberg 2001) 
because the parameter F removes the direct connection be- 
tween halo bias and halo mass (and hence halo density). 
The value of F is constrained to be larger than unity. This 
is our most important conclusion, and we will return to its 
implications in § 2] 



2 Note that the value of A would be expected lie in excess of 
the scatter of ~ 0.3dex in the local relation between velocity 
dispersion and black- hole mass. Inclusion of this constraint as a 
prior on A tightens the constraints on other parameters but does 
not change our qualitative conclusions. 



3.3 Constraints on individual parameters 



By analogy with equation (|16[) , the joint likelihood function 
^A,7,/ dut ,F,a can be marginalised over all remaining param- 
eters to find the likelihood for a single variable. For example, 
the likelihood for A is 



Ca oc 



/•/duty, max f-a„, a 
/ rf/duty / 

''/duty, min ''Oiiiiii 



da dj dF 

i 1min J F min 



c 



A,y,f dllty ,F,a- 



dP HP ■ HP ■ HP 

prior u--f prior l*- 1 prior ui-i prior 



da d/duty dj dF 
The a'posteriori probability density for A becomes 



HP HP ■ 

Ui± Uii prior 

—-r- oc Ca — T7 — , 
dA dA 



(17) 



with the normalisation, 
dP 



r~ dp_ 

1a., dA 



dA = 1. 
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Figure 4. Marginalised, a'posteriori differential probability distributions for a (left) and /duty (right)- The black and grey lines correspond 
to calculations performed using the Press-Schechter and Sheth-Tormen formalisms respectively. The solid (dashed) lines do (do not) 
include the correction factor of 2 in the quasar density (see text for details). 




Figure 5. Marginalised, a'posteriori cumulative probability distributions for 7 (left) and A (right). The black and grey lines correspond 
to calculations performed using the Press-Schechter and Sheth-Tormen formalisms respectively. The solid (dashed) lines do (do not) 
include the correction factor of 2 in the quasar density. 



The corresponding a'posteriori cumulative probability is 

Figure [4] shows the differential distributions dP/da and 
dP/d/duty The data constrains a value of a « —0.4 ± 0.5, 
while the value of /duty remains unconstrained. The left- 
hand panel of Figure [5] shows the cumulative distribution 
for 7. This distribution shows that 7 must be greater than 
unity at a statistical confidence of 90%. Similarly, the right- 
hand panel shows the value of 1 minus the cumulative prob- 
ability distribution for A. At 90% confidence, the scatter 
can only be shown to be below A = 1.1 (in difference from 
White et al. 2008). Finally, Figure [6] plots the differential 
(left, dP/dF) and cumulative [right, P(< F)] distributions 
for F. The most likely value is at F ~ 1.7. The probabil- 
ity drops rapidly towards low values of F and we find that 
F > 1.1 with high statistical confidence (99%). 

As noted above, our quantitative results are sensitive 
to the prior probability distributions because the number of 
free parameters is larger than the number of constraints. In 
particular, we find that the data does not constrain the up- 
per limit of the parameters 7 or F. As a result, our quantita- 
tive results are sensitive to the values chosen for the param- 
eters of 7 m ax and F max . However, if we increase the values 
of 7max and F mlD ., permitting a greater volume of parame- 
ter space a'priori, then our constraints on the lower limits 
for 7 and F are improved. Therefore, our qualitative con- 



clusions are not sensitive to the assumed prior probability 
distribution. In summary, we find constraints on individual 
parameters of a w —0.4 ± 0.5, 7 > 1 (at a statistical confi- 
dence of 90%), A < 1.1 (90%) and F > 1.1 (99%). 



3.4 Constraints on host halo mass 

Finally, we are able to use the above constraints to assess 
the range of halo masses that are consistent with the cluster- 
ing and luminosity function data. In our formalism the halo 
mass is not a free parameter. Rather, for each parameter set 
A, 7, /duty, F, Oii there is a halo mass MA, T ,/ dut ,F,a, which 
corresponds to the observed quasar density. Since our for- 
malism includes scatter in the L—M relationship, this mass is 
defined as the mean mass M at luminosity L (equation HJ]) . 
This parameter combination has likelihood CA,-y,f duty ,F,a- 
The probability distribution for M can therefore be com- 
puted from 



x 



dP 
dM 

/•A majt /-/duty, max 

/ dA d/duty 

J&min "'/duty.min 



,in I dj J dF 



.7,/duty.-F\a " 



dP dP ■ dP ■ dP ■ dP ■ 

u*± prior l*-* prior prior U-J prior t*-* prior 



dA d/duty da dj dF 
x S(M - M Ai ,, /lalj/ , a ), (18) 
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Figure 6. Marginalised, a'posteriori differential (left) and cumulative (right) probability distributions for F. The black and grey lines 
correspond to calculations performed using the Press-Schechter and Sheth-Tormen formalisms respectively. The solid (dashed) lines do 
(do not) include the correction factor of 2 in the quasar density. 




M (solar masses) M (solar masses) 



Figure 7. Marginalised, a'posteriori differential (left) and cumulative (right) probability distributions for halo mass M . The black and 
grey lines correspond to calculations performed using the Press-Schechter and Sheth-Tormen formalisms respectively. The solid (dashed) 
lines do (do not) include the correction factor of 2 in the quasar density. 

using the corresponding Press-Schechter formulation for the 
mass function (Press-Schechter 1974) and clustering bias 
(Mo & White 1996) to assess the level of theoretical sys- 
tematic uncertainty in our conclusions. 

In addition to the adopted model for the statistics of 
the halo population, the conclusions will also be sensitive 
to the value of the measured space density of quasars. Shen 
et al. (2007) pointed out that the space density is under- 
estimated by a factor of ~ 2 for the fit of Richards et 
al. (2006). Following White, Martini & Cohn (2008) we have 
used that correction factor in our analysis above. For com- 
parison, we also repeat our analysis using the uncorrected 
value of N = 0.35 x 10~ 7 Mpc" 3 . 

We therefore compare a set of four calculations, using 
combinations of the Sheth-Tormen or Press-Schechter halo 
statistics with the quasar density computed with or without 
the correction factor. In Figures r4l7l we show probability dis- 
tributions for the three additional cases together with our 
fiducial model. The constraints on 7 and a are unaffected 
by these model changes since these are not directly related 
to the numbers of observed quasars. The lower density of 
quasars and the larger predicted bias of the Press-Schechter 
formalism (and their combination) each yield weaker con- 
straints on A, F than our fiducial model. However the sys- 
tematic effect is mild, with limits of A < 1.1 (85%), and 
F > 1.1 (98%). We find that the minimum halo mass is 
effected by a factor of 2 depending on the assumed combi- 
nation. 



with corresponding cumulative distribution 

r°° <ip 

P«M) = / Md M'_ . (19) 

We plot these distributions in Figure [7] The host halo 
mass for the quasars in the high redshift sample is M ~ 
10 13±0 ' 5 M Q . In our fiducial model, the halo mass is re- 
stricted to be larger than ~3x 1O 12 M (99%). This mass 
range is comparable to the one (M > 5 x 1O 12 M ) quoted 
by Shen et al. (2007). However, note that here M refers to 
the halo mass corresponding to L at the mean of the L-M 
relation. The possibility of a large intrinsic scatter means 
that many host halos will have a significantly smaller mass. 

3.5 Theoretical uncertainties in the bias 

Thus far we have presented a fiducial model using the Sheth- 
Tormen forms for the clustering bias (Sheth, Mo & Tor- 
men 1999) and the mass-function (Sheth & Tormen 1999). 
However as demonstrated in White, Martini & Cohn (2008), 
the bias and density are sufficiently large as to make any 
quantitative conclusions sensitive to the detailed predictions 
of the Sheth-Tormen model. A discussion of the various an- 
alytic models and their comparison to numerical simula- 
tion is provided by White, Martini & Cohn (2008), who 
suggest that the Press-Schechter and Sheth-Tormen for- 
malisms should bracket the expected level of theoretical un- 
certainty. Following this assertion, we also present results 
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4 DISCUSSION 



We find that the value of F is constrained to be in ex- 
cess of unity, i.e. the observed bias is in excess of the halo 
bias. This could be interpreted as evidence that mergers 
are responsible for the quasar activity. We also find that 
7 > 1, implying that the luminosity to halo mass ratio 
increases with halo mass. Under the assumption that the 
Eddington ratio of bright quasars is nearly constant (as 
argued by Kollmeier et al. 2006), models of feedback lim- 
ited growth through energy deposition (Haehnelt, Natara- 
jan & Rees 1998; Wyithe & Loeb 2003) predict a value of 
7 = 5/3. If we adopt 7 = 5/3, then the preferred range 
of F (marginalised over other parameters) is 1.5 < F < 2. 
Interestingly, F ~ 1.5 is approximately the factor by which 
Furlanetto & Kamionkawski (2006) estimated mergers to in- 
crease the observed bias over the value of halo bias at high 
values of M. In our analysis we have allowed F to be a free 
parameter. This approach has the promise of constraining 
the merger process responsible for the triggering of quasar 
activity. For example, Furlanetto & Kamionkawski (2006) 
demonstrated that if close pairs lead to mergers, then merg- 
ers are expected to be clustered differently than individual 
halos. The magnitude of this difference is dependent on the 
clustering model, and so its measurement will constrain fun- 
damental properties of the merger process. 

Several anomalies have been previously noticed in the 
unexpectedly high clustering of systems that may be merger 
driven. For example, the clustering amplitude of Lyman- 
break galaxies (Adelberger et al. 2005) implies a host halo 
mass of ~ 1O 12 M0. However at this mass, the density of 
Lyman-break galaxies then implies that that the duty-cycle 
of vigorous star formation must be unity in all halos. More- 
over, kinematic measurements of these galaxies imply a sig- 
nificantly smaller mass of ~ W^Mq (Pettini et al. 2001; 
Erb et al. 2003). The tension between the different esti- 
mates of host mass could be alleviated if the intense star 
formation in Lyman-break galaxies is triggered by mergers, 
and if mergers in turn increase the observed bias (Furlan- 
etto & Kamionkowski 2006). In a recent paper, Wake et 
al. (2008) demonstrated that a radio-loud sub-sample of Lu- 
minous Red Galaxies at z ~ 0.55 had a clustering amplitude 
that is in excess of the radio-quite sub-sample, despite the 
optical properties being identical. Wake et al. (2008) inter- 
pret this as a larger host halo mass for radio loud objects. 
However, it might also be appealing to assign the excess bias 
to a merger origin of accretion activity. 

Of course there are alternative explanations for each of 
these observations. For example, dynamical measurements 
may only be accessing the central component of the host 
halos of Lyman break galaxies (Cooray 2005). In the case 
of radio loud Luminous Red Galaxies, the high bias could 
originate from the high bias of X-ray clusters (more massive 
halos), of which the Luminous Red Galaxies would be the 
central member. Additionally, since clustering statistics have 
been shown to depend on the halo formation history (Gao, 
Springel & White 2005; Croton, Gao & White 2007; Wetzel 
et al. 2007) , the age or structural properties the galaxy host 
could also effect the observed clustering. However in each 
case it is clear that the possibility of modification of the 
observed bias should be taken into account in interpreting 




0.01 



duty 



Figure 8. The theoretically expected value of a n 



as a func- 



tion of quasar duty cycle (see text for details) . Curves are plotted 
for four values of 7 = 1, 4/3, 5/3, and 2 (bottom to top). 



clustering data for many types of galaxies, such as starburst 
galaxies or radio galaxies. 

The value of a is also well constrained. Since the con- 
straint of F > 1 may suggest that quasar activity is triggered 
by mergers, it is interesting to calculate the predicted value 
of a using a model where the total duty cycle (i.e. summing 
over all quasar episodes in a single host) as a function of 
luminosity is proportional to the rate of major mergers. We 
use the extended Press-Schechter formalism to evaluate the 
rate of mergers of halos with mass Mi per unit mass with 
halos of mass M 2 , d 2 N/dM 1 dt\ (Lacey & Cole 1993). The 



M 2 

duty-cycle is then proportional to Mi d 2 N/dM\dt\ M ti t H, 
where t\ t is the lifetime of a single merger driven quasar 
episode and H~ l is the Hubble time at the redshift of in- 
terest (Wyithe & Loeb 2003). Assuming that ti t is not a 
function of mass (as expected if the quasar episode is re- 
lated to timescales like the black-hole doubling time or the 
dynamical time of its host galaxy), we find 

1 d 



7 d log M 



r, d2N 




log M -TTTT 


J] 


5 V dMdt 





(20) 



where we have assumed major mergers with Mi ~ M2 = M, 
and the term I/7 originates in the conversion from (flog A/ 
to cilogL for consistency with the definition of a. The re- 
sulting value of Qmerge as a function of duty cycle is plotted 
in Figure [8] for four values of 7 = 1, 4/3, 5/3, and 2 (bot- 
tom to top). While a value of 7 = 5/3 applies to feedback 
through energy deposition, models where the quasar drives 
the gas outflow from its host galaxy through momentum 
deposition have 7 = 4/3 (e.g. Silk & Rees 1998; Murray, 
Quataert & Thompson 2005; King 2003). Here, the value 
of ftmorgc has been computed at a halo mass for which the 
number density times the duty cycle is equal to the observed 
quasar density. We find that for this model we would expect 
a value of — 1 < awrgc ;$ —0.3, with larger values of 7 in- 
dicating less dependence of duty cycle on luminosity. These 
values are consistent with our constraints on a. 

We note that the scatter is only constrained to satisfy 
A < 1 dex. This range is significantly larger than the tight 
upper limit of ~ 0.3 dex found in the analysis of White et 
al. (2008). The difference can be traced to the greater free- 
dom in our model associated with: (i) the relation between 
luminosity and halo mass through 7; (ii) the relation be- 
tween quasar density and halo mass through a; and (in) the 
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allowance for the observed clustering bias to not directly re- 
flect the underlying host mass for merger-driven systems. 
We do find that models assuming F = 1, 7 = 1 (as con- 
sidered by White, Martini & Cohn 2008) reflect the small 
upper limit of A ~ 0.3 dex. However in addition to this 
low value of scatter being disfavoured in our more general 
model by the joint constraints from clustering and luminos- 
ity function, we would not expect such a tight relationship 
on observational grounds. In particular, an upper limit of 
0.3 dex implies a scatter that is smaller than the scatter in 
the black hole-velocity dispersion relation measured locally 
(~ 0.3 dex; Tremaine et al. 2002). However the relation be- 
tween quasar luminosity and halo mass must have several 
additional sources of scatter, including in the relation be- 
tween halo mass and bulge velocity dispersion, and between 
black hole mass and quasar luminosity. Hopkins et al. (2007) 
have investigated the scatter in relations between black-hole 
mass and galaxy properties including the bulge velocity dis- 
persion. They argue that the tightness of these relations im- 
plies that any connection between black-hole mass and halo 
mass must be incidental. Wyithe & Loeb (2005) have mod- 
eled the black-hole - halo mass relation, and have shown 
that the scatter in the black-hole velocity dispersion rela- 
tion combined with the statistical properties of dark-matter 
halos (Bullock et al. 2001) implies a scatter of at least 0.5 
dex at the redshifts of interest. Moreover, any relation be- 
tween luminosity and halo mass must also include a factor 
to account for the fraction of the Eddington rate at which 
the quasar accretes. Kollmeier et al. (2006) found a scatter 
in this distribution of 0.3dex. 

Finally, we note that the quasar duty-cycle is not con- 
strained in this model. This is in difference to previous re- 
sults (Haiman & Hui 2001; Martini & Weinberg 2001; Shen 
et al 2007; White et al. 2008), which have constrained the 
quasar lifetime from clustering. The difference originates 
from the fact that our model does not assume the one to 
one correspondence between observed clustering amplitude 
and halo mass (F — 1). This generalisation, motivated by 
the theoretical expectation that merging systems are likely 
to be more clustered than their isolated counterparts at fixed 
halo mass, means that a large clustering amplitude can be 
obtained by more common, lower mass halos, allowing the 
observed density to be achieved with a smaller duty-cycle. 
Moreover, Figure [3] demonstrates that even if a value of 
F ~ 1.5 (Furlanetto & Kamionkowski 2006) is adopted, the 
duty-cycle still remains unconstrained. This is because the 
large value of F allows for reproduction of the data over a 
wide range of possible values for the scatter (A), which in 
turn allows the model to predict a broad range of values for 
the halo abundance. 

In our analysis we have included the slope of the quasar 
luminosity function as an additional constraint to the quasar 
density and clustering. Before concluding we note that the 
addition of this constraint does not effect quantitative con- 
clusions regarding F or A, which are derived from compar- 
ison of the halo number density and bias to the observed 
density and bias. We have repeated our analysis without the 
constraint on the luminosity function slope. We find that the 
addition of this constraint allows a limit to be placed on a 
and tightens the constraint on 7. 



5 CONCLUSION 

The large amplitude of the observed clustering among 2 ~ 4 
quasars, combined with their density and luminosity func- 
tion slope, strongly constrains the relationship between 
quasar luminosity and halo mass at high redshifts. We 
have modelled these observables using the extended Press- 
Schechter formalism, combined with a mean quasar lumi- 
nosity (L) - halo mass (M) relation of the form L oc M 7 . 
We assume an intrinsic scatter (in dex) around the mean re- 
lation of A at a fixed halo mass. We find that the observed 
clustering amplitude and luminosity function slope cannot 
be simultaneously reproduced unless both of the following 
conditions hold: 

• The value of 7 is greater than unity, implying that 
quasar luminosity is not a linear function of host mass. 

• The observed clustering amplitude is in excess of that 
expected from dark matter halos (F > 1), possibly implying 
that the observed bias is boosted because mergers trigger 
quasar activity. 

The latter constraint on F is particularly strong. We 
find F > 1 .1 at the 99% level. On the other hand, we find 
that the scatter in the L — M relation can be constrained 
only to a value smaller than ~ I dex (in difference from the 
recent study of White et al. 2008). We find that because of 
the weak constraint on the scatter the mean host mass can 
be only weakly constrained, to within an order of magnitude 
around ~ 10 13 Mq. Importantly, and for the same reason, 
we find that clustering data combined with quasar densities 
does not constrain the quasar lifetime, as had been suggested 
previously (Martini & Weinberg 2001; Haiman & Hui 2001). 

Interestingly, the literature already contains a physi- 
cally motivated model which is consistent with all of the 
above constraints. A scenario in which galaxy mergers drive 
accretion onto a central black hole, which then radiates near 
its Eddington luminosity until it grows sufficiently to deposit 
(with a ~ 5% efficiency) the binding energy of the surround- 
ing gas over a dynamical time, yields an L—M relation with 
a value of 7 = 5/3 (Wyithe & Loeb 2003). For this value 
of 7, our constraints favor a value of 1 .5 < F < 2, which 
is consistent with estimates of the expected enhancement in 
clustering bias for merger driven activity of F ~ 1.5 (Furlan- 
etto & Kamionkawski 2006). 

In summary, the clustering statistics combined with the 
luminosity function of high redshift quasars, provide com- 
pelling evidence for a scenario of merger-driven quasar ac- 
tivity, with black-hole growth that is limited by feedback. 
The resulting luminosity-halo mass relation implies that the 
luminosity to halo mass ratio increases with halo mass as ex- 
pected in theoretical models that include feedback limited 
black hole growth. 
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